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Abstract 

This paper regards the relative locahzation problem in sensor networks. We study a 
randomized algorithm, which is based on input-driven consensus dynamics and involves 
pairwise "gossip" communications and updates. Due to the randomness of the updates, 
the state of this algorithm ergodically oscillates around a limit value. Exploiting the 
ergodicity of the dynamics, we show that the time-average of the state almost surely 
converges to the least-squares solution of the localization problem. Remarkably, the 
computation of the time-average does not require the sensors to share any common 
clock. Hence, the proposed algorithm is fully distributed and asynchronous. 

1 Introduction 

We consider the problem of relative localization for sensor net-works that can be described 
as follo"ws. We assume to have a group of agents representing the nodes of a graph, and a 
vector, indexed over the agents and unkno"wn to them. The agents are allo"wed to take noisy 
measurements of the differences bet"ween their vector entries and those of their neighbors 
in the graph. The estimation problem consists in reconstructing the original vector, up to 
an additive constant. While an optimal solution can be easily found by a centralized least- 
squares approach, we are interested in finding effective distributed solutions. More precisely, a 
solution is said to be distributed if it requires each node to use information "which is available 
only at the node itself or from its immediate neighbors. Follo"wing this approach, we have 
recently proposed [T] a randomized "gossip" algorithm for distributed relative localization. 
This algorithm, "which is inspired by a gradient descent approach, involves the activation of 
a randomly chosen pair of neighboring nodes at each time step. In our previous "work, we 
have already given a convergence result for the algorithm: the mean-square error bet"ween 
the time-average of the states and the optimal solution asymptotically goes to zero. 
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In this paper, we study the algorithm using tools from ergodic theory and we obtain 
a related convergence result: the time-average of the states converges almost surely to the 
optimal solution. Significantly, our definition of time-average does not require the agents to 
be aware of any global clock or of any global variable which counts the number of interactions 
occurring on the network. 

1.1 State of the art 

The abstract problem of relative localization is deemed to have important applications in 
sensor and robotic networks [2] • The proposed applications cover synchronization of uncertain 
clocks [3], as well as spacial localization [4J in mobile robotic networks when no absolute 
position information is available. In the control literature, the problem has been popularized 
by [51 [SI 1^ . Distributed algorithms involving synchronized updates by the nodes have been 
proposed in the last few years [H [3] . The latter paper uses a gradient-descent technique to 
solve the problem: this approach has been later followed in [3 [H] , as well as in our previous 
work [1]. Recently, papers have started to consider randomized asynchronous and randomized 
algorithms to solve the localization problem [9j [TOl [H] , see [12] for details about randomized 
algorithms. 

1.2 Contribution 

In this work, we prove an ergodic theorem for the algorithm in Ti Eq. (10)] and we show that a 
suitable time- averaging operation removes the persistent random oscillations which affect the 
"raw" estimates obtained through gossip communication and updates. The resulting time- 
averaged state converges almost surely to the optimal solution of the relative localization 
problem. 

The ergodicity analysis of seemingly non-convergent dynamics started to attract atten- 
tion in the theory of multi-agent systems and distributed control only recently. For exam- 
ple, we remark that the algorithm under study presents strong similarities with randomized 
algorithms for the PageRank computation, which have been recently proposed in several 
papers [HI [TH [15] . Indeed, randomized PageRank dynamics have been shown to converge 
modulo time-averaging: available results cover both convergence in mean square [13j and 
almost sure [T5] . We stress, however, that our proof of almost sure convergence is based on 
completely different tools from |16j . which uses techniques from stochastic approximation. 
Related applications of ergodic theory can be found in the context of social networks [17l [18] , 
where the authors show the ergodicity of specific opinion dynamics, which extend the well- 
known consensus dynamics to incorporate external influences and heterogeneous (stubborn) 
agents. Our analysis owes much to these techniques. 

Furthermore, we stress that ergodicity is a key property in enabling our framework 
to easily accommodate time- averages. Hence, we study an algorithm which is fully asyn- 
chronous and distributed. On the contrary, prior work on randomized algorithms assumed 
time-averages to be computed using a global iteration counter, see [l3[ 116], 
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1.3 Organization of the paper 

We formally present the problem of relative localization in Section [2j Then, in Section |3] we 
define the randomized algorithm and state the main convergence results. These results are 
proved by the analysis presented in Section |4] We conclude the paper with some remarks on 
future research in the final section. 

2 The problem of relative localization 

We consider a set of nodes V of cardinality N, endowed with an unknown scalar quantity Xy 
for V gV. The relative localization problem consists, for each node u gV, in estimating the 
scalar value Xu, based on noisy measurements of differences x„ — Xy with certain neighbors 
V. An oriented graph Q = (V,£) is used to represent the available measurements. The 
orientation of the pairs is conventionally assumed to be such that {v,u) € £ only ii u < v. 
We let A G {0, ztl}^'^^ be the incidence matrix of the graph which is defined as 

{+1 if e = (tj, w) 
-1 if e = {w, v) 
otherwise 

for every e G £. We let 6 e be the vector collecting the measurements 

b — Ax + f], 

where 77 G is random noise with £[77] =0 and E[7777^] = <J^I, and / is the identity matrix. 
We define the set of the optimal estimates in a least squares sense as 

X = argmin lIAz - 6II2 (1) 
where || • II2 is the Euclidean norm. The set X is described in the following well-known lemma 

m- 

Lemma 1 (Centralized solution). Let the graph Q be connected and let L :— AJ A denote 
the Laplacian of the graph. The following facts hold: 

1. X £ X if and only if A'^ Ax — A^b; 

2. there exists a unique x* G X such that \\x*\\2 = niin^^x Iklb/ 

3. X* = L^A^b, where denotes the Moore-Penrose pseudo-inverse of the Laplacian L. 

Note that x* is the minimum-norm element of the afhne space of solutions of ([T]) . Indeed, 
Ax* — A{x* -I- cl) for all scalar c, where 1 denotes the vector of ones. 

In this work, we are interested in designing randomized asynchronous algorithms to solve 
this problem: in view of the result above, we shall assume from now on that Q is connected. 
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3 Algorithm description and main results 



This section is devoted to describe the algorithm which was proposed in (TJ Section V] and 
to state our main results. 

The algorithm involves for each node w € V a triple of states {xy, depending on a 

discrete time index k E Z>o: these three variables play the following roles: a;„(fc) is the "raw" 
estimate of Xy obtained by v at time k through communications with its neighbors, Kvik) 
counts the number of updates performed by v up to time fc, and Xy{k) is the "smoothed" 
estimate obtained through time-averaging. The algorithm is defined by choosing a scalar 
parameter 7 S (0,1) and a sequence of random variables ^(fc)fcgz>o taking values in £. At 
each time k, provided that 9{k) = {u,v), the state updates are performed according to the 
following rules: the estimates evolve as 

Xuik + 1) = (1 - 7)x„(fc) + 7X^,(fc) + 76(„,„) 

Xy{k + 1) = (1 - -f)xy{k) + -fXu{k) ~ 76(„_„) (2a) 
Xw{k + 1) = Xjy{k) ifw^{u, w}; 

the local times as 



(2b) 





-1) 


= Ku{k) 




-1) 


= Ky (fc) 




-1) 


= Kyj (fc) 



ii w ^ {u, v}; 
and the time-averages as 

Xu{k + 1) = — , (Ku{k)xu{k) + Xu{k + 1)) 

Ka{k+l)' 



Xy{k + 1) = ———(Ky{k)Xy{k) + X y {k + l)) 

Ky{k+1)^ 

Xyj{k + 1) ^ Xyj{k) iiw^{u,v}. 



(2c) 



We assume the sequence {6{k)}kez>o to be i.i.d., and its probability distribution to be 
uniform, i.e., 

V[0{k)^{u,v)] = ^^, VfceZ>o, (3) 

where \£\ denotes the cardinality of £. Note that this choice is made without loss of generality 
and convenience: the same approach may accommodate other distributions, as required by 
the application. 

It should be noted that the time index k in fact counts the number of updates which 
have occurred in the network, whereas for each u G V the variable Ku(fc) is the number of 
updates involving u up to the current time. Hence, is a local variable which is inherently 
known to agent u, even in case the common clock k is unavailable. This algorithm is totally 
asynchronous and fully distributed, in the sense that the time averaging process does not 
need the nodes to be aware of a common clock. This point provides a major improvement 
with respect to our previous work [1] Theorem 3]. 
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The dynamics i n (pa| ) oscillates persistently and fails to converge in a deterministic sense, 
as shown in Figure [Ij However, the oscillations concentrate around the solution of the least 
squares problem, as it is formally stated in the following two results. The first result regards 
the behavior of the average dynamics. 



Proposition 2 (Convergence in expectation). Consider the dynamics (2a) with uniform 
edge selection ([3|. Then, 

lim W.[x{k)] = X*. 

The second result, instead, states that the sample dynamics is well-represented by the 
average one, i.e., the process is ergodic. 



Theorem 3 (Almost sure convergence of ergodic means). The dynamics in (2a I, with uni- 
form edge selection ([3| and x{0) = 0, is ergodic. // {nij^gN is a sequence of nonnegative 
integers, then with probability one 

1 ^ 

lim — > x(nf)=x* . 
1=1 

From these two facts, which are proved in the next section, we immediately deduce the 
following statement, which motivates the definition of the algorithm previously introduced 
indicates that Xu{k) is the right variable" to approximate the optimal estimate x*. 



Corollary 4. The dynamics in ( 2c ) is such that 



lim x{k) 



with probability one. 



4 Ergodicity analysis 

In this section we study the convergence properties of the vector x{k) for the update model 
described in (2a). As shown in Fig. [l| the estimate of each agent oscillates persistently; on 



the other hand the time averages approach the optimal solution x* in Lemma [T] Although 
the process x{k) almost surely fails to converge, we prove that it converges in distribution to 
a random variable Xoo and is ergodic. 

To begin, we rewrite the dynamics of (|2a|) as 



x{k + \)^Q{k)x{k) + y{k) (4) 

where 

Q{k) = / - 7(e„ - e„)(e„ - e„)^, 
vector Cu denotes the vector of the canonical basis corresponding to u, and 

y{k) = be{k){eu - e„) 

provided 0{k) = {u,v) with k G Z>o. Consequently, we prove the following result either by 
direct computation or by using Lemma 5 in [1]. 
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Lemma 5. For the distribution ([3|, it holds 

E[Q(fc)]=/-7^, E[y(fc)]=7^- 

We note that for all k the matrix Q{k) is doubly stochastic and the sum of the elements 
in y{k) is zero, that is, 

l^g(fc) = l, Q{k)l^l, l^y(fc) = 0. (5) 

In particular, if the vector a;(0) is initialized to zero, then 1^ x{k) — for each k G Z>o. 

These observations imply that the dynamics of x{k) is equivalently described by the 
iterate 

x{k + l) = Pik)x{k) + y{k), (6) 

where P(fc) = Q{k){I - ^11^) = (/ - ^ll^)(3(fc) is a projection of Q{k) outside the 
"consensus sub-space" spanned by 1. This rewriting is instrumental to study the convergence 
behavior of the process {x{k)}kei.>o, by asymptotic techniques of iterated random functions, 
which we recall from [TH]. These techniques require, in order to study the random process ([6]), 
to consider the associated backward process '^{k), which we define below. 

For any time instant fc, consider the random matrices P{k) and y{k) and define the matrix 
product 

l^{e, m) P{m)P{m -1)---P{£+ 1)P(£) (7) 
with £ £ {0, . . . , m}. Then, the iterated affine system in Q can be rewritten as 

x{k + l) ^l^{0,k)x{0)+ J2 ~^{i + l.k)y{i). (8) 

0<f<fc 

The corresponding backward process is defined by 

lF(fc + i) = ^(o,fc)x(o)+ ^(o,^-i)2;W, (9) 

0<l<k 



where 

^(^, m) P{l)P{l + 1) • • • P(m - l)P{m) (10) 

with £ S {0, . . . Crucially, the backward process Izr(fc) has at every time k S Z>o the 
same probability distribution of x{k). The main tool to study the backward process is the 
following well-known result. 

Lemma 6 (Theorem 2.1 in |19j). Let us consider the Markov chain {^(fc)}fcgN defined by 

X{k + l)= A{k)X{k) + B{k) keZ>o 
where A{k) £ M^^^ and B{k) £ are i.i.d. random variables. Let us assume that 

E[log \\A{k)\\] < oo E[log \\B{k)\\] < oo. (11) 



6 



The corresponding backward random process X{k) converges a.s. to a finite limit Xqo if and 
only if 

(12) 



inf -E[log||A(l)...A(fc)||] < 0. 



If (12 1 holds, the distribution of X^o is the unique invariant distribution for the Markov 
chain X{k). 

This result provides conditions for the backward process to converge to a hmit. Although 
the forward process has a different behavior compared to the backward process, the forward 
and backward processes have the same distribution. This fact allows us to determine, by 
studying the backward process Ir (fc), whether the sequence of random variables {x(fc)}fcgz>o 
converges in distribution to the invariant distribution of the Markov chain in 

This analysis is done in the following result. 

Lemma 7. Consider the random process x{k) defined in where Pik) and y{k) are i.i.d.. 
Then ^{k) converges a.s. to a finite limit Xrx,, and the distribution of is the unique 
invariant distribution for x{k). 

Proof. In order to apply Lemma |6j let us compute 



inf 

feeN k 



log||P(0,A:-l)||i 



< inf ilogE [||P(0,fc- 1) 
fceN fc L 



inf - log E 

km k 



< inf - logE 

feGN k 



maxV(^(0,fc-l))„, 

wGV ^ — ^ 



^^(P(0,fc-1)), 
.wev vev 



< inf i log y y E [^(0, fc - 1) 
- V 

[V(0,fc-1) 



wev vev 



< inf - log ( n E 
fceN fc V II 



inf - log n 
fceN fc \ 



fc-i 



nE[F(£)] 



Let q be the number of distinct eigenvalues of E[P(fc)], denoted as and con- 



sider the Jordan canonical decomposition E [P(fc)] — UJU ^. Then 



< 
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||f^lloo|l>^'^lloo|lC^ ""^lloo- Since the fc-th power of the Jordan block of size s is 



A 1 
A 1 







A 1 



••• A_ 







A" 



( ^jA^-^+i- 



A'= 



we deduce that 



S£ — l 



oo = max 
vev 



\ k—m 



^(J'=),„= max ^A* 

w^V m— 

where Si is the size of the largest Jordan block corresponding to A^. Then 

Sl — l 



II J* 



< 



max lAfl'' ^ |A£ 



m=0 
s^ — l 



m 



< max lAfl'^fc" J2 l^^r™ 
where x is a constant independent of k and 

p = \\npm\\2- 

From Lemma [5) it follows that p < I. We conclude that there exists a constant C 
l|t^l|oo||C^~"'^||ooX: independent of fc, such that 



E 



log||>(0,fc- l)||i < log(nCp'=fc"^ 



and, consequently, 



inf -E 

feeN k 



< lim 



log||P(0,fc-l)||i 
log(CnA;"p'=) 



log p < 0. 



The claim then follows from Lemma [6l 



(13) 



□ 
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As a consequence, we deduce that also the (forward) random process x{k), defined in 
converges in distribution to a limit Xoo, and the distribution of x^o is the unique invariant 
distribution for x{k). The oscillations of (2a) are ergodic, as stated in the following result. 

Lemma 8. The random process x{k) defined in ([6| is ergodic; if {n^ j^gpj is a sequence of 
nonnegative integers, then with probability one 



1 ^ 

lim —y x{nt)—K[xa 
k^oo k ^ — ' 



Furthermore, W^Xryo] — A^b. 

Proof. We begin by showing the ergodicity. Let z(0) be a random vector independent from 
a;(0) with the same distribution as Xoo- Let {z(fc)}fegz>o be the sequence such that 

z{k) = l^{0,k-l)z{0)+ l^{£+l,k-l)y{£) 

o<e<k-i 

where 'P(£ + 1, fc — 1) is defined as in Q. It should be noted that, since the process z{k) 
is stationary, we can apply the law of large numbers and immediately conclude that, with 
probability one, 

fe-i 



lim — z(s) = Eb 



On the other hand, we compute 

P(||a;(fc)-z(fc)|li >e'=) 

E[|li^(0,fc-l)(z(0)-x(0))|li 



< 



E 



< 



|7^(o,fc-i)||i||z(o)-x(o))|| 



E 



< 



|7^(0,fc-l)||ilE[||2(0)-a;(0)|| 



< 



E[||z(0) - a;(0)|| 



where we have used (13 1. If we choose e e (p, 1), then the Borel-Cantelli Lemma [201 
Theorem 1.4.2] implies that with probability one ||a;(A:) — < for all but finitely 

many values of fc > 0. Therefore, almost surely ^ X]s=o II ^(■5) ~ ^(■5)11 1 converges to zero as 
fc goes to infinity, 

^ k-l 

lim - x{s) = E[a;oo]. 
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The statement follows when we observe that the same arguments hold if we replace all sum- 
mations over the nonnegative integers with summations over a subsequence of nonnegative 
integers. 
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To complete the proof, we only need to compute the expectation of Xao- Using the 
independence among P(fc)s and y{k)s, we obtain 

E[x{k)] = ^Pfx{0) + J2 HP{k)f~'^~^Hy{k)] 

o<e<k-i 

and we conclude from Proposition [2] (see also p/i, Proposition 6]) that 

E[a;oo] = lim E[x(k)] = L^A^b. 

□ 

5 Concluding remarks 

In this paper, we have studied a randomized gossip algorithm for the relative localization 
problem, which is distributed and asynchronous. Using tools from ergodic theory, we have 
shown the almost sure convergence of the algorithm to the optimal solution. Future research 
will seek broader applications of these techniques in multi-agents systems. 
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Figure 1: Complete graph, = 20, 7 = 0.1, zero initial conditions. 
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